Ensembles of Classifiers from Spatially Disjoint Data
نویسندگان
چکیده
We describe an ensemble learning approach that accurately learns from data that has been partitioned according to the arbitrary spatial requirements of a large-scale simulation wherein classifiers may be trained only on the data local to a given partition. As a result, the class statistics can vary from partition to partition; some classes may even be missing from some partitions. In order to learn from such data, we combine a fast ensemble learning algorithm with Bayesian decision theory to generate an accurate predictive model of the simulation data. Results from a simulation of an impactor bar crushing a storage canister and from region recognition in face images show that regions of interest are successfully identified.
منابع مشابه
Creating Ensembles of Classifiers
Ensembles of classifiers offer promise in increasing overall classification accuracy. The availability of extremely large datasets has opened avenues for application of distributed and/or parallel learning to efficiently learn models of them. In this paper, distributed learning is done by training classifiers on disjoint subsets of the data. We examine a random partitioning method to create dis...
متن کاملEnsembles of data-reduction-based classifiers for distributed learning from very large data sets
An ensemble of classifiers is a set of classification models and a method to combine their predictions into a joint decision. They were primarily devised to improve classification accuracies over individual classifiers. However, the growing need for learning from very large data sets has opened new application areas for this strategy. According to this approach, new ensembles of classifiers hav...
متن کاملUsing classifier ensembles to label spatially disjoint data
We describe an ensemble approach to learning from arbitrarily partitioned data. The partitioning comes from the distributed processing requirements of a large scale simulation. The volume of the data is such that classifiers can train only on data local to a given partition. As a result of the partition reflecting the needs of the simulation, the class statistics can vary from partition to part...
متن کاملMeta-ensembles of Classifiers for Alzheimer's Disease Detection Using Independent ROI Features
Due to its growing social impact, prodromal detection of Alzheimer’s disease is of paramount importance. Biomarkers based on Magnetic Resonance Imaging (MRI) are one of the most sought results in the neuroscience community. In this paper we evaluate several ensembles of classifiers trained and tested in a two level ensemble scheme as follows: the 116 regions of interest (ROI) of the Anatomical ...
متن کاملAuthorship Attribution Based on Feature Set Subspacing Ensembles
Authorship attribution can assist the criminal investigation procedure as well as cybercrime analysis. This task can be viewed as a single-label multi-class text categorization problem. Given that the style of a text can be represented as mere word frequencies selected in a language-independent method, suitable machine learning techniques able to deal with high dimensional feature spaces and sp...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2005